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Abstract 

It has been generally believed that the multiple-input multiple-output (MIMO) channel capacity grows linearly with the size of antenna arrays. 
In terms of degrees of freedom, linear transmit and receive arrays of length L in a scattering environment of total angular spread |Q| asymptotically 
have |f2|L degrees of freedom. In this paper, it is claimed that the linear increase in degrees of freedom may not be attained when scattered 
electromagnetic fields in the underlying scattering environment are statistically correlated. After introducing a model of correlated scattering, which 
is referred to as the colored scattering model, we derive the number of degrees of freedom. Unlike the uncorrelated case, the number of degrees 
of freedom in the colored scattering channel is asymptotically limited by \Q\ ■ min{L, 1/r}, where T is a parameter determining the extent of 
correlation. In other words, for very large arrays in the colored scattering environment, degrees of freedom can get saturated to an intrinsic limit 
rather than increasing linearly with the array size. 

Index Terms 

MIMO systems, antenna arrays, channel capacity, degrees of freedom, physical channel modeling, scattering. 

I. Introduction 

From the time when it was revealed that the multiple-input multiple-output (MIMO) systems promise significant gains, which had never 
been achievable with any advanced single-antenna systems, in the spectral efficiency and the reliability of communication systems, the MIMO 
systems have been intensively studied in many different perspectives |1|-|16|. In recent years, along with the advances in implementation 
technologies, the implementation and deployment of MIMO systems have become active research topics. For example, advanced space-time 
coding techniques are being adopted by the standards of modern communication systems, such as high-throughput wireless LAN (IEEE 
802.1 In) and 3GPP Long Term Evolution. In particular, very large MIMO systems |18|, also known as massive MIMO, are currently being 
studied to meet the increasing needs for high-efficiency, green communication systems. Basically, most of these MIMO technologies are 
based on the premise that the capacity and diversity gains are improved with the increasing number of antennas. 

In a number of early studies on the MIMO fading channel, the rich scattering environment was often assumed and the channel was modeled 
as a random matrix with independent and identically distributed (i.i.d.) elements, each of which represents the fading channel coefficient 
between transmit and receive antennas. In the landmark work by Telatar (TJ, it was shown that, for the MIMO channel with i.i.d. fading, 
the channel capacity grows linearly with the numbers of the transmit and receive antennas. In much subsequent work followed, the i.i.d. 
MIMO channel was thoroughly studied and lots of insight on the MIMO system was gained. In particular, many space-time coding schemes 
were designed based on the understanding of i.i.d. MIMO channel model [19|. However, in the real applications, the i.i.d. MIMO channel 
model has very limited importance because the scattering environment is rarely rich enough and the channel coefficients are correlated |2|, 
(3). To address the requirements for more realistic MIMO channel models, the Kronecker model was introduced |3|-|5|. In this model, 
correlation between channel coefficients is posed by multiplying transmit and receive correlation matrices on the right and left of a matrix 
of i.i.d. elements. Things become more complicated than the i.i.d. case, but it was shown in [ 5 1 that the capacity of the Kronecker model 
still grows in proportion to the number of transmit and receive antennas. That is, the correlation introduced in the Kronecker model only 
impacts the rate of growth of the capacity, not the linearity itself. Though the Kronecker model facilitates the capacity analysis through the 
powerful random matrix theory, it fails to capture the dependence between the transmit and receive correlation characteristics and has also 
proven to be inconsistent with measurement results |4|. Recently, efforts were made to incorporate more physical natures of scattering into 
channel modeling. In the literature related to physical channel modeling | 6|-(9). it was stated that, while the transmit and receive signals can 
be described by the excitation current distribution on antenna arrays (array domain), the propagation environment can be better explained 
in the angular domain in terms of the strength of the electromagnetic fields radiated from or impinging on the arrays. The two domains 
are convertible to each other through an appropriate transform determined by the geometry of the arrays. In particular, for linear arrays, the 
mapping between them is given by the Fourier transform, and there exists an analogy between the time-frequency domain and angle-array 
domain pairs. Given spatial distributions of scatterers, the electromagnetic fields at the transmit and receive sides are related in the angular 
domain through the Green function, and the joint response between them, in terms of the angles of arrival and departure, is given by a 
scattering response function (see Fig. [T). In 1 10 1, [11 1, simplified angular domain channel models were presented based on discrete antennas 
and discrete scattered paths. Though many insightful observations were made from the discrete path modeling, it is less realistic because 
the scattered paths are often continuously dispersed and clustered in many practical situations such as urban and indoor environments. Also, 
for the angular domain channel model, it was shown in [ 12|-| 14 1 that putting more antennas in a fixed array aperture does not always bring 
increasing gains due to increased correlation between channel coefficients. Instead, it was observed that the size of the array aperture is 
more important than the number of antennas and, thus, in |9|, 1 15], 1 16], the notion of a continuous array, which corresponds to the infinite 
number of infinitesimally-spaced antennas packed in a given aperture, was adopted. Based on this model, the intrinsic channel characteristics 
independent of the antenna configuration, such as the capacity and the diversity, were derived as functions of the aperture size and the total 
angular spread of scattering clusters. In particular, in [16], three different mechanisms of scattering, i.e., specular reflection, single-bounce 
diffuse scattering, and multi-bounce diffuse scattering, were introduced and their individual impacts were analyzed. 

In this paper, we focus on another intrinsic characteristic of the channel, which has not been addressed in existing literature; the correlation 
in the scattering response function. The clusters of scatteres are often composed of a large but finite number of scattering objects. Though 
the scattered fields by a single object can be statistically correlated 1171 . 1311 , due to the limited spatial resolution of the arrays, only 
combined and smoothed effects of the fields that are scattered by multiple scatterers in the resolvable angle are observed as shown in Fig. 
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Scatterer clusters 




Fig. 1. Clustered scattering model. 




(a) Low-resolution array (b) High-resolution array 



Fig. 2. Responses of low- and high-resolution arrays for the same scatterers. 



|2(a)| Therefore, it is reasonable to assume that the scattering response is given by an uncorrelated (or white) Gaussian random process 
in the angular domain as in 1131 , 111 61 , based on the central limit theorem. However, when we are concerned with a very large array, the 
spatial resolution can be so high that only a few objects falls into the resolvable angle as in Fig. |2(b)| In such a case, the uncorrelated 
scattering model seems no longer plausible and some correlation characteristics may need to be appended to the conventional model. For this 
purpose, we consider a simple correlated scattering model based on the notion of bandlimited white Gaussian random process. For the sake 
of distinction, we refer to the conventional uncorrelated scattering model in 1 16] as the white scattering model, and our correlated scattering 
model as the colored scattering model. 

In Section [TTJ we first introduce the colored scattering model and justify it compared to the conventional white scattering model. 
Assuming multi-bounce diffuse scattering, Section [Hi] provides a canonical representation for the colored scattering response by the spectral 
decomposition technique. Also, Section ITVl shows how the channel is divided into subchannels with linear transmit and receive arrays. The 
key tools used in Sections Unl and HVl are the Karhunen-Loeve theorem 1321 and prolate spheroidal wave functions [23 ]. They have been 
widely used in the analysis of time- and bandlimited signals, either deterministic or random, and can be directly applied to the analysis of 
the scattering response due to the analogy. That is, the counterparts of time- and bandlimited signals in the time-frequency domains are the 
array aperture- or correlation-, and angular spread-limited responses in the angle-array domains. Based on the decomposed subchannel model 
and assuming the fast-fading channel, the ergodic capacity is derived. It is shown that, for a given signal-to-noise ratio (SNR), the pre-log 
factor of the ergodic capacity, which is also known as the number of degrees of freedom, is bounded by the product of the array length 
and total angular spread. This is consistent with the previous result for the white scattering model in |16|. However, unlike the previous 
case, an interesting observation is made for the colored scattering model that the number of degrees of freedom is saturated to a certain 
value determined by the correlation characteristic of the channel. Similarly in 1101 , it has already been shown that the degrees of freedom is 
saturated to the number of discrete and independent propagation paths in the channel. However, there is a fundamental difference between the 
two saturation phenomena since the scattering response is assumed to be clustered rather than discrete in this paper. To verify the theoretic 
result derived in Section Hvl a MIMO system with discrete antennas is considered in Section [Vj and its capacity bounds are numerically 
computed in the colored scattering environment. The numerical results show that the capacity bounds do not linearly increase with the array 
size but eventually get saturated to certain values. Finally, Section [VTl discusses extensions of the result of this paper. Though multi-bounce 
diffuse scattering, linear arrays, and fast-fading are mainly assumed in the analysis of this paper, some extensions to the other situations, 
such as single-bounce diffuse scattering and slow-fading, are straightforward. Also, some concluding remarks are given in Section ["VTl 

The following notations are used throughout this paper. The Dirac delta function is denoted by S(-) and, for integers mi and ni2, the 
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Kronecker delta is denoted by S mi ,m 2 - sine a; denotes the sine function (sin7ra;)/(7ra;). Vectors or matrices are denoted by boldface symbols 
and the superscripts *, T, and H denote conjugate, transpose, and conjugate transpose, respectively. The determinant and trace of a square 
matrix A are denoted by det[A] and Tr(A), respectively, and I is the identity matrix of the appropriate size. The sets of real and complex 
numbers are denoted by R and C, respectively. For real numbers a and b (a < 6), [a, b] is the closed interval {a; G M|o < x < b}. The 
statistical expectation is denoted by E{-} and, for a set .4, \A\ denotes the Lebesgue measure, log a;, with no indicated base, denotes the 
logarithm of base 2 and the natural logarithm is denoted by In a;. Finally, o(-) is the standard little-o notation. 



II. Scattering channel and array models 

To focus on the intrinsic characteristics of the channel, we consider the continuous array model as in |9|, |15], |16|, which yields results 
not depending on the array configuration, such as the number and positions of the antennas. Thus, the conventional discrete antenna model 
can be seen as a sampled version of the continuous array model. More specifically, we assume linear arrays of length Lt and L r , which 
are normalized to the wavelength, at the transmitter and the receiver, respectively. At the transmitter and the receiver, we indicate specific 
positions on the arrays by scalar coordinate variables p G [— L t /2, Lt/2] and q G [— L r /2, L r /2], respectively. Then, for a narrow band 
transmit signal, which is represented by a current distribution x(p) on the transmit array, the received signal, which is also represented by 
a current distribution y(q) on the receive array, is given by 

J/(<?) = / c(q,p)x(p)dp+ z(q), (1) 
J -Lt/2 

where c(q,p) is the channel response and z(q) is the zero-mean white Gaussian noise process with an autocorrelation function (ACF) 
E {z(qi)z*(q2)} = a 2 5(qi — q2) for qi,q2 G [— L r /2, L r /2^\. At the transmitter, the transmit power constraint is imposed as 

E{ f Lt/2 \av>)\",li> \ - />. .2! 



f \x(p)\ 2 dp\ < P 

J-L t /2 J 



Physically, the channel response is modeled as a cascade of the transmit array response a t (a,p), the scattering response h(/3,a), and the 
receive array response a r (/3,q), i.e. 

poo poo 

e(<?,p) = / / a*(/3,q)h(P,a)a t (a,p)dadp, (3) 



— oo •/ — oo 



where a and /3 are directional cosines for the transmit and receive arrays, respectively, as shown in Fig.Q] Therefore, h(/3, a) is physically 
meaningful only for a G [—1,1] and /3 G [—1,1] and vanishes for other values of a and j3. In far-field scattering, where the scatterers are 
sufficiently far from both the transmitter and the receiver, the array responses of the linear arrays are given by 

a t (a,p) = e - S27 " xp , (4) 
a r {P,q) = e-^". (5) 

Note that, a and ft are variables in the angular domain, while p and q are variables in the array domain. As mentioned in the introduction, 
the main benefit of assuming linear arrays is that the mapping between the array and angular domains is done by the Fourier transform, 
which greatly simplifies the analysis. Hence, an equivalent angular domain representation of the relationship in (0 is given by 

/oo poo poo 
/ L r sinc (L r (P-f3')) h(P',a) I L t sine (L t (a - a)) X(a)dada'dp' + Z(J3), (6) 
- oo J — oo J — oo 

where X(a), Y(j3), and Z(f3) are Fourier transform pairs of x(p), y(q), and z(q), respectively. That is, 



and 



rL t /2 

X(a) = / x(p)e~ ]2llav dp, (7) 

■) -L t /2 

Y(P) = 2 y(q)e-^^dq, (8) 

J -L r /2 

Z(P) = f Lr/2 z(q)e~> 2 ^dq. (9) 

J -L r /2 

In we can observe the convolution operations with L r sine (L r p) and L t sine (L t a), which operate as low-pass filters. In other words, 
the transmit and received signals are smoothed in the angular domain due to the finite apertures of the arrays. As noted in the literature II II . 
1331 Ch. 7], those smoothing operations are closely related to the notion of spatial resolution of the arrays. As the array sizes Lt and L r 
grows, the smoothing functions have narrower main lobes, which leads to the higher resolution for paths in the angular domain. Note that the 
convolution between Lt sine (Lta) and X(a) in ((6) is redundant because X(a) in is smoothed by construction due to the finite-length 
transmit array. Nevertheless, we will not leave it out for later convenience. 

Now, let us establish the characteristics of the scattering response h(/3,a). In the clustered scattering environment, the scattering response 
is modeled as a continuous random process. The support of the scattering response, on which the response may have nonzero values, is 
determined by the placement of scattering clusters. As shown in Fig. Q] we suppose that there are Alt and M r disjoint clusters seen at the 

'For consistency's sake, we try to follow the same signal model and reuse the notation as in (9), 1161 - 
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transmit and receive arrays, respectively. At the transmit side, the angular subintervals of a subtended by the scattering clusters are denoted 
by fi(,i, i = 1, . . . , M t , and the union of the subintervals is defined as fi t = U^fi^i. Similarly, at the receive side, the angular subintervals 
of /3 are denoted by fi r ,;, i — 1, . . . , M r , and the union of the subinterval is defined as fi r = ufi^fir,;. In |16|, it was shown that the 
statistical characteristics of the scattering response depend on the scattering mechanisms, such as specular reflection, single-bounce diffuse 
scattering, and multi-bounce diffuse scattering. In particular, the specular reflection is deterministic in nature, and the responses for diffuse 
scattering are assumed to be zero-mean white (uncorrelated) random processes with an 

E{h(p 1 ,ai)h*{p 2 ,a 2 )} = 8(p 1 -p 2 )-5(a 1 -a 2 ), (10) 

where the support is 

(Pk,a k ) eSlrx!!,,^ 1,2, (11) 

for multi-bounce diffuse scattering and 

M t 

(featleU^.xSi^^U, (12) 

for single-bounce diffuse scattering {Mt = Mr). Elsewhere outside of the support, the scattering response is always zero. Though it has been 
quite often subsumed in the literature 1111 . 1131 , 1161 , the validity of the above white scattering model for diffuse scattering is somewhat 
suspicious. In 1171 . 1311 . the correlation functions of electromagnetic fields scattered by random objects or rough surfaces were studied. 
The overall correlation function for a scattering cluster can be characterized by superposing the contributions of the constituent objects. 
However, when there is a huge number of scattering objects in the cluster, finding the exact correlation function for the cluster is almost 
impractical. Moreover, since the spatial resolution of the arrays is limited in practice, only smoothed effects of those scattered fields are 
observable as shown in (|6) and Fig. |2(a)| Therefore, it seems rather acceptable to assume that the scattering response is a white random 
process and the observed channel response is Gaussian distributed with zero-mean by the central limit theorem. In terms of the number of 
degrees of freedom of random processes, white random processes have an infinite number of degrees of freedom [20), |2l|. How many of 
those degrees of freedom can actually be used is solely determined by the spatial resolution of the array. It follows that the number of usable 
degrees of freedom increases linearly with the array sizes (7), (9), 1161 . Depending on the composition of the scattering clusters and the 
array resolution, this may be true to some extent. However, though it could be very large, the number of scattering objects in each cluster is 
finite in practice. Moreover, as we increase the array size, we may reach a point at which the spatial resolution of the array is comparable 
to the angular interval subtended by a single scattering object as shown in Fig. |2(b)| At this point, the correlation of scattered fields can 
be captured by the array and the white scattering model may no longer be reasonable. Instead, as a generalization of the white scattering 
model, we assume that scattering response is a zero-mean propefl Gaussian random process with 

E{h(p 1 ,a 1 )h*(l3 2 ,a 2 )}=±- sine (^^) -r sine (^ff 1 ) , (13) 

where the support is given by J 1 1 b or l !12t depending on whether it is multi-bounce or single-bounce diffuse scattering. The parameters T t 
and r r are referred to as the correlation widths at the transmitter and the receiver, respectively, and determines the extent of correlation 
in the angular domain. In contrast to the ACF of the white random process (Toj, the ACF H3\ corresponds to that of a bandlimited white 
random process 1201 . Note that the ACF J X 3 b approaches to that of the white scattering d 1 Ob as T t and T,- tend to zero. Therefore, the new 
model includes the white scattering model as a special case. In distinction from the white scattering model, we refer to the scattering model 
with the ACF d 1 3 1 as the colored scattering model. For notational simplicity, we denote the ACF d 1 3 fc as 

R h {f3 1 ,f3 2 ,a 1 ,a 2 ) = R r {Pi, p 2 )Rt{a!,a 2 ), (14) 

where 



Rt(a 1 ,a 2 )=r r -E{h(/3,ai)h*(P,a 2 )} 



1 . / c*i - a 2 

sine — (15) 



r t V r t 

for ai,a 2 £ fit and f3 £ fi r in the multi-bounce case and ai,a 2 £ Clt,i and P £ fir.j, i = !,■■■ , Mt, in the single-bounce case, and 



RriPufc) = Tt- E{h(f3 u a)h*{p 2 ,a)} 
1 . (Px~P 2 

1 r 



(16) 



for a 6 fit and Pi, p 2 £ fi r in the multi-bounce case and a £ fit,i and Pi,fi 2 £ fi r ,i, i — 1, • • ■ , Mt, in the single-bounce case. In the 
remainder of this paper, dl51 and d 1 6b are referred to as the transmit and receive ACFs, respectively. 

Remark 1: When there are more than one scattering cluster, it is plausible to assume that the channel responses for different scattering 
clusters are uncorrelated. For example, for ai £ fit,.; and a 2 £ fit,*,, i 7^ k, we should expect Rt(ai,a 2 ) = 0. One shortage of the 

2 In 1161 , scaling of the ACF depending on the propagation distance and the number of bouncing through the path was considered to reflect path loss. 
Though the path loss can reduce the received signal power, it does not affect the number of degrees of freedom of the channel. Therefore, in our analysis, 
we omit the scaling factor. 

3 The definition and properties of proper complex random processes can be found in 1221 . In short, the properness states that the real and imaginary parts 
of the random process are uncorrelated. 
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correlation functions (JT3J and J 1 6b is that the uncorrelatedness between scattering clusters is not accounted for in the multi-bounce case. 
However, when F t and F r are very small, which is generally the case, and the separation between the clusters is large enough, the ACF 
J 1 3 b does not lose its generality and simplifies the analysis, which follows in the next section. 



III. Spectral decomposition of the scattering response 

Through a spectral decomposition with properly chosen basis functions, random processes can be disassembled into uncorrelated random 
variables providing the canonical representation of the underlying process. In particular, for the scattering response whose ACF is factorized 
into the transmit and the receive ACFs as in d 14b . the spectral decomposition can be applied separately to each side giving the notion of 
transmit and receive subspaces. The number of effective subspaces with non-vanishing gains and the way that the transmit and receive 
subspaces are related give us the insight to the intrinsic number of degrees of freedom and the diversity gain of the channel. 

It has been discussed in the literature |23|, |32| that bandlimited white random processes with a bounded time support can be decomposed 
using the prolate spheroidal wave functions (PSWFs) as the basis. From the analogy between the time-frequency domains in waveform 
channels and the angle-array domains in linear arrays, the PSWFs can also be used for the spectral decomposition of the clustered and 
colored scattering response. Thus, in this section, we first introduce some properties of the PSWFs and then study the spectral decomposition 
of the colored scattering response h{j3,a). In this and the following sections, we are particularly interested in the multi-bounce diffuse 
scattering whose support is given by d I lb - The extension to the single-bounce case is discussed in Section [VTI 



A. Required notions and theorems 

Before proceeding further, we first introduce some essential notions and theorems. We denote by C 2 the set of all complex valued functions 
f(t) defined on the real line R and square integrable, i.e. 



r 

■J — : 



\f{t)\ dt < oo. (17) 



Analogously, the set of all complex valued functions f(t) defined on A, which is a compact subset of R, and square integrable on A is 
denoted by C\. Now, let R : A 2 —¥ C be a continuous square integrable Hermitian kernel, i.e. 

R(t,s) = R*(s,t), (t,s) G A 2 . (18) 

Associated with the kernel, a bounded linear self-adjoint operator Ta(R) '■ Ca —¥ L.\ is defined as 

r A {R)\m\ = / R(t,s)f(s)ds, (19) 

J A 

for a function fit) G C\. Things of particular interest to us are the eigenfunctions ip n {t) and eigenvalues A n of the linear operator that 
satisfy 

/ R{t,s)%l> n {s)ds = \ n ip n {t). (20) 

J A 

In the remainder of this paper, we assume that the eigenfunctions and eigenvalues are correspondingly ordered such that Ao > Ai > A2 > • • • 
without loss of generality. Finally, for W > 0, a function f(t) G C 2 is called W -bandlimited if 



Wsinc (W(t-s))f(s)ds = f(t). (21) 

In other words, the Fourier transform of a W^-bandlimited function vanishes outside [— 2^]. Also, the set of all W-bandlimited functions 
is denoted by Bw- 

The following theorem is a minor modification of well-known Mercer's theorem |34| and Karhunen-Loeve theorem |32|, which we refer 
here without proof. 

Theorem 1 (Mercer H34V and Karhunen-Loeve H32V ): For a compact interval A = [a, b] on the real line, let x(t) G C 2 A be a zero-mean 
proper complex random process. The ACF of x(t) is defined as R(t, s) = E {x(t)x* (s)}, (t, s) G A 2 . We denote the sets of eigenfunctions 
and eigenvalues of Ta{R) by {tpn{t)} and {A„}, respectively. If R(t,s) is strictly positive definite, A n > 0, and we can find tp„(t) being 
orthogonal and complete in C\ and 

iprn (t)ipn 2 {t)dt = A ?ll 5 ni ,„ 2 . (22) 



Then the random process x(t) can be expanded in terms of ip n (t) as 

x(t) = Y^x n i/) n (t), t G A, (23) 

n=0 

where the convergence is in the mean-square sensfl and 

x n = -!- / x{t)^* n {t)dt. (24) 
4 The convergence in the mean-square sense is differently stated for deterministic and random signals: for x(t) £ C 2 ^, 



im / 

J-, 



lim 



c(t) - ^2 x n ip n (t) 



2 

dt = 



6 



In addition, the coefficients of the expansion satisfies 



and 



E{x n } = 0, 



E{x 

no i n 1 j n 2 



(25) 



(26) 



Therefore, the Karhunen-Loeve (KL) expansion j23i provides an expansion of the random process in orthogonal basis functions with 
uncorrelated coefficients. For Gaussian random processes, Theorem [T] has stronger properties as stated in the following corollary. 

Corollary 1: When x(t) G C\ is a zero-mean proper complex Gaussian random process with a strictly positive definite ACF R(t,s), 
the convergence of the KL expansion d23t is almost sur^l and the coefficients x„ in i24\ are independent and identically distributed (i.i.d.) 
circular-symmetric Gaussian random variables with zero-mean and unit variance. 

Now the following theorem introduces the PSWFs, which, in conjunction with Theorem [TJ comprise the foundation for the spectral 
decomposition of the scattering response. 

Theorem 2 (Slepian and Pollak [23^): Consider a positive definite kernel on a compact interval A = [a, b], 

R(t,s) = W sine (W(t-s)), (t,s)£A 2 , (27) 

where W > 0. Let {ipn(t)} and {A n } be the sets of eigenfunctions and eigenvalues of Ta(R), respectively. Then, the following properties 
hold. 

1) ip n (t) are real and bandlimited: ip n (t) G Bw- 

2) 4>n{t) are orthonormal on the real line and complete in Bw'- 



f 

j — [ 



tp ni (t)tp n2 (t)dt = 5 ni ,n 2 , ni,m = 0, 1,2, . 
3) ip n (t) are orthogonal on A and complete in C\: 

ip ni {t)il) n2 {t)dt = A ni 5 ni ,n2) ni,na = 0,1,2, 



4) A n are real, positive, and distinct: 1 > Ao > Ai > A2 > 

5) For t G 

R(t, s)^ n (s)ds = X n tp n (t), n = 0, 1, 2, . 



(28) 



(29) 



(30) 



The widely used terminology to refer to tp„(t) in Theorem [2] is the prolate spheroidal wave functions. In the remainder of this paper, we 
denote {ipn(t)} by Va,w, where the parameters are explicitly indicated in the subscript. 

Remark 2: The compact interval A in Theorems [Tj and [2] can be replaced by a finite union of disjoint compact intervals. The extension of 
Theorem[Tjfor the multiple intervals can be found in [25]. Also, the extension of Theorem [2] was initially pursued in |24| and the asymptotic 
distribution of the eigenvalues was derived. The composition of PSWFs for the multiple interval case can be found in 1261 , 1271 . 



B. Spectral decomposition of the scattering response 

For the transmit ACF R t (qi , 02), the sets of eigenfunctions and eigenvalues of Tn t (Rt) are denoted by {ipt,n(oi)} and {A ti „}, respectively. 
Then, from dl5| l and by Theorem[2] ip t ,n{ot) are PSWFs. In multi-bounce diffuse scattering, h(J3, a) has an ACF proportional to R t (a\, 02) 
on il t for a fixed f3 G £l r - By Theorem Q] it can be expanded as 



h(/3, a) = J2 MW*,n(«), a e fit 



(3D 



where 



M/8) 



Note that h n (/3) is a Gaussian random process with 



and 



1 

At,n 



h(/3, a)i> t ,-n{a)do 



E{h n (f3)} = 0, 



(32) 



(33) 



when x(t) is a deterministic signal, and 



lim E 

N— >oo 



x(t) - 22 x ni'ri(t) 
n=0 



: 0, t e A, 



when x(t) is a random process. Both of the statements are standard [32], and, for all expansions in orthogonal functions, the convergence in the mean-square 
sense is tacitly assumed in the remainder of this paper unless otherwise stated. 
5 For every e > 0, 

JV-l 



Pr < lim 

N—too 



x(t) - 22 x ni>n(t) 



< e 



1. 
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£{/i ni (/3i)/i* 2 (/3 2 )} = t ~r / / E {h(/3 1 ,a 1 )h*(l32,a2)}i>t,n 1 (cei)ipt,n2(a2)da 1 da2 

(a) 1 



(b) 1 



Rr(/3i,p2)Rt(ai, a2)i>t,n 1 {ai)ipt,n 2 (a2)daida.2 
Rr (Pi , /SaJV't.m («i)^t,n 2 (ai)dai 



= i2r(/3i,/Sa)-in 1 ,« a , (34) 

where (a) is from the definition of the autocorrelation function d 1 3 1 > and (b) and (c) are by the properties of PSWFs in Theorem [2] 

Now, associated with the receive ACF R r (Pi, P2), we define {ipr,n(P)} and {A r ,„} as the sets of eigenfunctions and eigenvalues of 
7n r (Rr), respectively. Then, ipr,n{P) are PSWFs from l l 1 6t and by Theorem|2] Since h n (/3) has an ACF of 7i r (jSi , ^2 ) as shown in ( 134b . 
it can be expanded by Theorem Q] as 

00 

M/3) = ft m,nVv,m(£), P 6 fir, (35) 

where 

ftm,„ = ~t~ / h n (P)ipr, m (P)dp. (36) 

Also, by Corollary Q] ft m ,„ are zero-mean i.i.d. circular symmetric Gaussian random variables, i.e., 

E{h m , n } = (37) 

and 

E{h (38) 

Therefore, d3U and J35| l yield an expansion 

a) = Y, Y, h m ,n4>r,m(P)i>t,n(a), (P, a) £ x fit- (39) 

m = n— 

Note that the expansion l |39t is intrinsic to the channel and does not depend on the transmit and receive arrays. The basis functions ipt,n{a) 
and ip r , n (P) form the transmit and receive subspaces, respectively, along which the signal can be transmitted. However, as it will be seen 
in the following section, it depends on the arrays how those subspaces are exploited. 

IV. Capacity bound for the colored scattering channel 

In this section, an upper bound for the ergodic capacity of the colored scattering channel is presented as a function of the array length, the 
channel correlation width, the total angular spread of scattering clusters, and the signal-to-noise ratio (SNR). Before that, we first consider 
a useful lemma regarding the relationship between two different sets of PSWFs. 

Lemma 1: Consider a compact interval (or finite union of disjoint compact intervals) A and bandwidths W\ and W2, where Wi > W2 > 0. 
Let {ipn(t)} and {4>n(t)} be Va,W\ and Va,W 2 ! respectively. The corresponding sets of eigenvalues for {tp n (t)} and {4> n (t)} are also 
denoted by {A n } and {7™}, respectively. Then there exist real coefficients Cm,n (m,n = 0, 1, 2, . . .) such that 

00 

<t>m(t) = ^ Cm.nVViW (40) 
n = 

00 

Cn(t) = Xn ^ Cm,n<l>m(t), (41) 
m=0 

Cn(t)= / Wasinc (W 2 (t-s))ip n (s)ds. (42) 

J A 

oo 

(43) 



and 



where 



In addition, the coefficients c m , n satisfies 



and 

oc 

(44) 

n=0 

Proof: Note that Bw 2 Q Bw t since W% < W\. Therefore, by property 2 of Theorem [2] we can decompose cj> m (t) in terms of the basis 
functions t[>n{t) as ( |40| >, where 

Cm,r> = T— / 4>m(t)tpn(t)dt. (45) 
An J A 

Since both (j) m [t) and ip n (t) are real functions, the coefficients c m , n are also real. Then, from the orthogonality of <f) m (t), we have 



x 



r 

J — c 



4> mi (t)<j) m2 (t)d,t — 5 mi ,m 2 

oo 

= ^ ^ Cm 1 ,nCm2,?t] (46) 
n = 

where the second equality follows by using j40\ and the orthonormality of il> n (t) on the real line. That is, the sequences [c m _o, c mi i, c m> 2, ■ • ■] 
are orthonormal for different m. In addition, we have 



4>m 1 {t)4>m 2 (t)dt = 7 mi S mi ,m 2 

oo 

(47) 

n=0 

where the second equality follows by using l |40| > and the orthogonality of ip n (t) on A. Now, from J42fr . note that are the ,4-timelimited 
and VWbandlimited versions of tp„(t). Then, since ( n (t) are W-Vbandlimited, they can be expressed as 

oo 

Cn(t) = J2 Cn,m4>m(t), (48) 
m=0 

where 

Cn.m = / C,n{t)(j) m {t)dt 

7™ J.A 

- — f [ W 2 sine (W 2 (* - s)) ^ n (s)ds<j> m (t)dt 

7m 

(b) 



— W 2 sinc (W 2 (s - t)) 4> m {t)dtij) n (s)ds 

7m ./.A i^l 



= / 4> m {s)ll>n{s)ds 
J A 

= A„c ra ,„, (49) 

where (a) is from d42t . (b) is by changing the order of integrations and by the even symmetry of the sine function, (c) is by property 5 of 
Theorem [2] and (d) is from ( 145 l l. ■ 
In the following subsections, discrete subchannel decompositions of the transmit and receive signals are discussed. Along with the spectral 
decomposition of the scattering response ( 1391 ), an equivalent canonical representation of the relationship in $6$ is obtained in the angular 
domain and the upper bound for the ergodic capacity of the canonical model is investigated. 

A. Subchannel decomposition 

We suppose that the transmit ACF R t (a\,ot2) and fi t are known at the transmitter, and the receiver has full channel state information. 
In particular, for the sake of simplicity, we focus on the symmetric case in which we assume L t = L T — L, Ft = IV = T, M t = M r , and 
fit | = | fir | = where \Q\ = j n da. Since the subchannel decomposition depends on the array length L and the correlation width T, we 
first tackle the case with L < 1/T, and then the case with L > 1/T afterward. 

1) Case 1: L < 1/17 Denote the eigenfunction and eigenvalue sets of 7n t (Q), where Q(at\, a 2 ) = Z/sinc (L{pt\ — 012)), by {4>t,n(a)} 
and {jt,n}, respectively. By Theorem [2] the eigenfunctions form a PSWF set "Pn t ,L, which is a basis for Bl- Therefore, without loss of 
generality, any transmit signals from the array of length L can be expressed in the angular domain as 

00 

X(a) = £jf n ^, n (a), (50) 

n = 

where 

X n = — / X(a)<p t , n (a)da. (51) 
7t,n Jn t 

By Parseval's theorem, the transmit power constraint © can be rewritten as 

/ |A»| 2 da =^>{|X„| 2 } 

J -°° > n=0 

< P- (52) 

Analogously, we denote the eigenfunction and eigenvalue sets of Tn r (Q) by {</> r ,m(/3)} and {7r,m}, respectively. By Theorem[2] {(j>r,m(l3)} 
is Vn r ,L and forms a basis for Bl- Therefore, any received signals of the array of length L can be represented in the angular domain by 



Y (P) = Y ™<f>r,™(P), (53) 
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where 

Y m = — I Y{P)(j> r , m {P)dp. (54) 

7r,ra Jn r 

In particular, the noise Z(/3) embedded in Y(f3) is decomposed as 



m=0 

where 



Z(/3) = Z m <t>r, m {P), (55) 

— / z{p)<t> rsm {p)dp. 



7r, 

Then, it can easily be shown that Z m are i.i.d. circular symmetric Gaussian random variables with zero-mean and variance of cr 2 

E{Z mi Z^ 2 }= / / £{^(/3i)2'(/? 2 )}^, mi (/?i)^, m2 (/3 2 )d/3id/3 2 

7i-,mi7i-,m 2 J Cl r J Cl r 



(a) a 2 



(b) a 



7r,m 1 7'-,«i2 Jn r Jn r 

2 

4>r, mi (/3l)0r,m 2 {Pl)dPl 



(56) 



= cr o mi ,m 2 , (5/) 

where (a) is by the definition of Z(f3) in j9j, and (b) and (c) are by the properties of PSWFs in Theorem [2] 
Now, we substitute ( 139b into ^ and obtain 

r. r. CO OO /"OC 

Y(P)= / Q(l3,fi')^2^2h m ,ni>r,m{fi')^,n{a') Q(a' , a)X (a)dada' d/3' + Z (f3) 

o° o° ^oo 

= ^^h m , n C, r , m (P) Ct, n (a)X(a)da + Z{P), (58) 

™_n„ — n ■' — oo 



where 

C*,n(a) = / Q{a', a)ij)t, n {a)da , (59) 

Cr,m(P)= [ Q(P,P')i>r,m(P'W'- (60) 

From the assumption that L < 1/T and by Lemma [T] ( |59b and J60b can be expressed as 

OO 

(t,n(a) = A t ,„ ^ C k ,n<t>t,k{a), (61) 

oo 

Cr,m(^) = K,m ^ ,m <^r,! (/?) , (62) 
i=0 

for some real coefficients Ck, n and d;, m . Plugging ( 150b . ( 155b . ( 161b . and ([62} into ( 158b and using ( 154b and ( 156b . we have a discrete canonical 
representation of the received signal 

oo oo OO 
Yl — dl im X rtm h m]n Xt,n ^2 Ck,nXk + Zl, I = 0,1,2, ... . (63) 

m=0 n=0 fe=0 

2) Case 2: L > 1/T: As discussed in the previous case, any transmit signals from the array of length L can be expressed in terms 
of basis functions <j>t, n {ot) as ( 150b , From the spectral decomposition of the scattering response (39), the subspace in the angular domain, 
through which signals can be conveyed from the transmitter to the receiver, is spanned by {i^t,n(a)} on fl t . By extending the support of 
the basis functions from Sit to the real line and by Theorem [2] we see that any signals that can be conveyed through the channel can be 
interpolated to signals in Bi/r- In other words, X(a) € £>i/r is a necessary condition for signals not to be nulled by the channel. On the 
other hand, the subspace spanned by {4>t,n(ot)} on the real line is Bl, which includes Bi/r when L > 1/T. Therefore, it is sufficient for 
the transmitter to use {^>t,n{ot)} as the basis for the transmit signal instead of {<f>t,n(oi)}. The transmit signal is now given by 

oo 

X(a) = £x n ^t,»(a), (64) 

where 

X n = -L / X(a)^ t , n (a)da. (65) 
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\A\W 

Fig. 3. Distribution of eigenvalues of T^(R) in Theorem[3] 



Ga,w(x) 



Note that the same power constraint as d52t is applied in this case. Using d39b and d64t in {6](, we have 

Y{fi) = J2 I>™," / Q{P,P')i>r, m {P'WKnX n + Z(P), (66) 
m=0 n=0 

where the fact that X(a) in d64t is 1/r-bandlimited, i.e., Q(a' , a)X(a)da — X(a'), is used. By Lemma[T] when L > 1/r, we can 



write 



ipr,m(J3) = ^2 d„ 



(67) 



for some real coefficients dm : i. Therefore 



Q(P,P')ip r>m (p')dp' =^2jr,id m ,i<j> r ,i(P). (68) 

,n r 1=0 

As in the previous case, any received signals from the array of length L can be expressed as d53t . Using d66t and d68t in d53b and d54t . we 
obtain 



Yl = r fr,ldm,lh m . 1 „Xt,nXn + Zt, I = 0, 1, 2, . . . , 

m= n— 

where are i.i.d. circular symmetric Gaussian random variables with zero-mean and variance of a 2 as denned in d56| l. 



(69) 



B. capacity upper bound 

The capacity upper bounds for the canonical representations d63t and d69t are investigated. For the characterization of the capacity, the 
distributions of eigenvalues \ t , n or -y t , n, and A r . m or "/ r ,m are essential, since they measure the gains of transmit and receive subchannels. 
In the earlier work by Landau and Widom [24], the asymptotic distribution of those eigenvalues was found as in the following theorem. 

Theorem 3 (Landau and Widom [24]): Let A be a union of M (< oo) disjoint compact intervals. Consider a kernel on A 

R(t, s) = W sine (W(t - a)) , (t, s) £ A 2 , 



where W > 0. Then the number of eigenvalues of Ta(R) that exceed x (0 < x < 1) is given by 



G A ,w(x) = \A\W + 



M l-x 
-— In 



In (2tt \A\ W) + o (In (\A\ W)) , \A\ W -¥ oo, 



(70) 



(71) 



7T Z X 

where \A\ = j A dt. 

The distribution of the eigenvalues is graphically shown in Fig. [3] 

Now we first consider the upper bound for the capacity when L < 1/r. For notational simplicity, let us consider an infinite-dimensional 
vector space representation of d63t as 



y = DA r HA,Cx + z, 



(72) 



where y = [Yi, Y 2 , . . .] T , x = [Xi,X a , ■ ■ -f , and z = [Z 1; Z 2 



Also, D, H, and C are matrices whose (m, n)-th elements are given 



by d m>n , h m ,n, and c m ,„, respectively, and A r and At are diagonal matrices whose n-th diagonal elements are A r , n and At,n. respectively. 
It is well-known that the capacity of the system d72t is achieved by a zero-mean Gaussian input x and given by 



C = max E -j log det 

Tr(Q)< P 



1+ ^-DA,HAtC T QCAtH H A r D T 



(73) 



where Q = E {xx H }. Note that, due to the infinite dimensionality of the model, the determinant in d73t should be carefully defined in 
the limiting sense as in (5j, 1151 , 1161 . That is, we first assume that only finite numbers of dimensions of x and y in d72t are used, which 
gives the capacity representation d73l l, and then the numbers of dimensions are taken to infinity. Letting x = C T x, we can see in d72t that 
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x — x — y forms a Markov chain. Also, due to the orthonormality of the coefficients c m , n (Lemma[T|l, x = [Xo,Xi, . . .] T satisfies the 
same power constraint: 



e s {i^i 2 }=emi^i 2 }< p - 



Therefore, we have the following upper bound for the capacity: 



C < max E <, log det 

Tr(<5)<J> 



1+ ^-DA r HAtQA t H ff A r D 2 



(74) 



(75) 



where Q = E {xx H }. In it was shown that the optimal input covariance matrix to achieve the ergodic capacity is a scaled identity 
matrix when the channel matrix has i.i.d. entries. Though this is not the case in d75K it was shown in |28|-|30| that a diagonal input 
covariance matrix achieves the ergodic capacity if any two column vectors of the channel matrix are independent and the entries have a 
symmetric distribution around zero. As a result, without loss of generality, we can let Q in d75b be a diagonal matrix with _E{|X„| 2 } on its 
diagonal. Based on the fact that logdet(-) is concave, Jensen's inequality is applied to | |75I > giving another upper bound 



C < max log det 

Tr(Q)<P 



(a) 



max log det 

T.^ =0 E{\X n \?}<P 



I + ^DA r £ |HA t QA t H H } A r D T | 

I + ^ -daJd 



(ft) 

< log det 



I 



4da?d t 



(76) 



where (a) follows from the fact that Q and A t are diagonal and h m ,n are i.i.d. random variables with zero-mean and unit variance, and (b) 
from the fact that < Xt, n < 1. By further applying Hadamard's inequality to l !76t , we obtain 



c<^io g i+-^o: 



2 

l,m 



(a) - / p 

< ^ log 1 + — ^ A r , m di,„ 



(b) 



E log ( 1 + 



(77) 



where (a) is from the fact that < \ r ,m < 1 and (b) is by Lemma Q] 

Now, we turn to the upper bound for L > 1/T case. Analogously to the previous case, the infinite-dimensional vector space representation 
of l |69t is given by 

y = A,DHA t x + z, (78) 

where D is a matrix whose (m, n)-th element is d m ,„, A,- is a diagonal matrix whose n-th diagonal element is f r ,n, and the other variables 
are defined as in l !72t . The capacity of the system d78b is then characterized as 



C = max E i log det 

Q: 

Tr(Q)<-P 



1+ — A r DHAtQA t H H D T A r 



(79) 
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We apply the same bounding procedure as in d76t to l |79l l, which yields 



r p - -t - - 

C < log det I + — -A r D DA, 

( =' log det I + 4-DA^D T 

a 1 

where (a) is by Sylvester's determinant theorem. Applying Hadamard's inequality to d80t . we have 



(80) 



m=0 



J2 
m. . / 



1 = 1.) 



m=0 \ ; = o 



j2 

m,l 



- A /• 



(81) 



where (a) is from the fact that < j r ,l < 1 and (b) is by Lemma [T] Note that the upper bounds ( 1771 ) and ( 181b are determined by the 
distribution of eigenvalues j r> i and A r , m . Whether L < 1/F or L > 1/r, the number of eigenvalues 7 n j or A r>m exceeding a; (0 < x < 1) 
can collectively be quantified by Theorem [3] as 



G(x) 



._. A M . 1 — ; 

fi A + — In 

7H a; 



■ In (2tt A) + o (In (|fi| A)) , A 



where A = min{L, 1/r}. Define 



r>t \ M l-. 
G{x) = -In 

1T A X 



■ In (27r |f2| A) 



such that G(x) = —G(x) + \Q\ A + o (In (|f2| A)). Also, a parameter e is determined to satisfy 



M 

n A + — In- 



• In (27r |fi| A) = 0. 



(82) 



(83) 



(84) 



7T" 1 — e 

Then, as shown in Fig. [4] 1 — e approximately represents the largest eigenvalue. Using the collective distribution of eigenvalues l !82t , the 
upper bounds in l !77t and J8U can be written in a single expression as 

log ( 1 + dG(x) + o (In A)) , A ->■ oo. (85) 



C < 



The same form of integration on the right-hand-side of d85t has already been dealt with in 1161 Lemma 3.4], and {85} can be compactly 
reformulated as 



C < 



Ifll A + In (27r If2| A) / — 

(J 1 



+ o(ln(|Q| A)) , |f2| A 



where 



/ 



2tt ; 



■ m -— + o In -— 



P_ 



(86) 



(87) 



One interesting point to observe in iS6\ is that the pre-log factor, i.e., the number of degrees of freedom, is dependent on the SNR. In 
particular, the dependence becomes prominent when |fi|A is small. When arrays with discrete and finite number of antennas are considered, 
such dependence on SNR is not observed in general. In fact, the SNR-dependence of the number of degrees of freedom has already been 
pointed out multiple times in the literature considering continuous arrays |7|, |8|, |16|. It can intuitively be explained as follows. As shown 
in ( 177b and jX 1 [ i, the eigenvalues of the array or the channel represent gains of parallel channels. Therefore, assuming unit signal power, 
P = 1, the number of eigenvalues exceeding the noise level will determine the number of usabl^] channels. By Theorem [3] about |fi|A 
eigenvalues are close to 1 and then plunge near zero in a transition region of width around M In(27r[fi| A)/tt . With reference to Fig. [4] if 
the transition of eigenvalues from ~ 1 to ~ is not abrupt enough compared to |fi|A, i.e., for a small |fi|A value, the number of usable 
channels will noticeably depend on the noise level. 

The result of this section is summarized in the following theorem. 

Theorem 4: For the channel under colored scattering with correlation width Y and total angular spread |fi|, the ergodic capacity achieved 
by linear transmit and receive arrays of length L at a given SNR PI a 2 is bounded by 

c < |n|Aio g ( i + 4) +o(|n|A) 



(88) 

V cr ~ J 

as |n|A — > oo, where A = min{L, 1/r}. 

Interestingly, and as somewhat expected, the result in Theorem [4] coincides with the result in [16, Theorem 3.5] when L < 1/r. That is, 
at a given SNR, the capacity may almost linearly scale in proportion to the array length L. However, in the colored scattering environment, 
the capacity gets saturated to an intrinsic limit posed by the channel. 



s This means that the eigenvalues above the noise level have dominant contributions to the overall capacity. It does not imply that the capacity vanishes 
when <7 2 > 1 because no channel is usable. 
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V. Numerical results 

To validate the theoretic result provided in Section ITVl some simulations are performed. Since the continuous scattering and continuous 
array models in Section [XT] are not convenient for numerical simulation, we consider the discrete scattering and discrete array models as in 
II II . Assuming linear arrays with 2L + 1 discrete, half-wavelength-spaced antennas (indexed from — L to L), the discretized version of the 
channel response between the ra-th receive and n-th transmit antennas is given by 



c m ,n = r\ y~] \~] a*(k,m)h(k,l)at(l,n),m,n- 

k=-K l=-K 

for some nonnegative integer K, where r\ is a scaling factor and 



(89) 



o t (/,n) = e~ i2ir Ti:^, I = -K,. 
a r (k,m) =e~ j2n -K^, k = -K, 



,K, n = —L, . . 

,K, m = —L, 



(90) 
(91) 



In addition, to capture the correlation between scattered fields, the discrete scattering response h(k, I) is randomly generated to have a 
circular symmetric Gaussian distribution with zero-mean and an ACF 

' k\ — &2 \ . / h — li " 



E 



{h(k 1 ,l 1 )h*(k 2 ,l 2 )} 



1 



AT 



KY 



for 



ki h\ (ki h \ „2 
K' K '\K' K £ 



-1.0, -0.7] U 



(92) 



(93) 



More specifically, we choose K = 2 11 , L = 0, 1, . . . , 49, and fi 
Also, the scaling factor rj in d89b is determined such that Ylm=-L' Yln=-L' ^ {I' 
discrete antenna signal model is given by 

y = Hx + z, 



-0.15,0.15] U [0.7, 1.0], i.e., M = 3 and |fi| = 0.9. 
| 2 } ){2L' + l) 2 = 1 for L' = 49. Then, the resultant 

(94) 



where x and y are (2L+1) x 1 transmitted and received vectors, z is a (2Z/+1) x 1 noise vector comprised of i.i.d. circular symmetric Gaussian 
elements with zero-mean and variance of a' 2 , and H is a (2L + 1) x (2L + 1) channel matrix whose (m,n)-th element (m,n = —L, . . . , L) 
is Cm,n- According to the assumption in Section HV-AI the transmitter only knows the distribution of the channel; correlation between entries 
of H. Based on this information and the transmit power constraint P, the optimal input covariance matrix Q = E {xx H } is determined to 
achieve the ergodic capacity. However, determining the optimal input covariance matrix and computing the ergodic capacity of the correlated 
MIMO channel is cumbersome, because there is no closed form expression and, thus, numerical optimization is required |29], 1 30 1 . Therefore, 
as alternatives, we evaluate two performance metrics in [5], which serve as the lower and upper bounds for the ergodic capacity, respectively: 
the first one is the average mutual information with equal power allocation, given by 

P 



hh+i — E < logdet 



1 + 



-HH 



a 2 {2L + 1) 

and the second one is the average capacity with full channel state information at both transmitter and receiver, given by 



(95) 



C2ZH 



E • 



max log det 

Q: 

Tr(Q)<f 



I + -!-HQH H 



(96) 



The values of /2L+1 and C2L+1 are computed through Monte Carlo simulation with 10 4 trials, and the curves of 72l+i and C2L+1, 
normalized by Co = log(l + P/a ), are plotted in Fig. [5] for different values of F and different SNRs. Note that, since the inter-antenna 
spacing is fixed to half a wavelength, the sizes of array apertures grow with the number of antennas. In the figure, we can observe that the 
growth rates of I2L+1 and C2L+1 with respect to the number of antennas (equivalently, array aperture size) are severely affected by the 
correlation width V. It is seen that they do not linearly increase with the number of antennas and, in particular, the upper bound C2L+1 
eventually gets saturated to a certain value; the larger the correlation width, to the smaller value the upper bound gets saturated. On the other 
hand, the lower bound /2L+1 increases with the number of antennas up to a certain point and then gradually decreases. This is because the 
available degrees of freedom are not efficiently used, while waterfilling is used for the upper bound to make the best use of the degrees 
of freedom. However, the gap between the upper and lower bounds narrows as the SNR increases. In summary, the results in Fig. [5] verify 
the statement in Section ITVl in the colored scattering environment, the ergodic capacity may not linearly grow with the array size but gets 
saturated to a certain limit. 



VI. Discussion 

A. Single-bounce diffuse scattering 

In the analyses in Sections IIII-BI and IIVI only the multi-bounce diffuse channel has been considered. As noted in Section |Il] single- and 
multi-bounce diffuse channels share the same ACF U3\ but have different supports. Both in the single- and multi-bounce cases, the ACF can 
be expressed as a product of the transmit and receive ACFs as in dl4t . However, in the single-bounce case, the transmit and receive ACFs 
are not completely separable in the sense that the supports of those two functions are intertwined. If we consider a single scattering cluster 
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Fig. 5. Bounds for the ergodic capacity of the colored scattering channel; linear arrays with half- wavelength- spaced antennas are assumed. 



at once and ignore the other clusters, the transmit and receive ACFs are separable; the spectral decomposition of the scattering response in 
the pair of transmit and receive subintervals corresponding to each cluster is given by 



h(/3,a) h ( £ n 4:Ui3)4%(a), 6 a* x nt, ( ,t : 



1,2, 



(97) 



where hm,n are i.i.d. circular symmetric Gaussian random variables with zero-mean and unit variance, and 4> ( t %(a) and ^L(/3) are PSWFs 
'Psiti^/r an d T>n r -,i/r> respectively. Assuming L < 1/r, the transmit and receive signal can be written similarly to the multi-bounce 
counterpart as 



M 00 



X(a) 



and 



i = l n = Q 
M 00 

= E E *tfV£mG8). 



(98) 



(99) 



i=l m=0 



where ^'^(a) and </jf'm(/3) are PSWFs Pn t tj £ and Vn ri ,L, respectively. In 1241 . 1261 . the asymptotic orthogonality of the PSWFs with 
disjoint time or frequency supports was addressed. That is, for i\ 12, ipln i a ) an d ' ( a ) 316 asymptotically orthogonal either on 
Q,t,i t or on Qt,i 2 for a small correlation width F (large 1/r). The same asymptotic orthogonality can be stated for ipl]ln(f3), <7j^(a), and 
<j>r}m{fi) (large L). Due to this asymptotic orthogonality, the single-bounce diffuse channel is asymptotically equivalent to M independent 
single-cluster multi-bounce diffuse channels (the same argument was made in 1161 ). Therefore, after some straightforward manipulation on 
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J86l >. we have the capacity upper bound for the single-bounce diffuse channel as 

M 



c < 



A + ^ln(27r|fii| A)/ 



Ma' 



log 1 + 



Ma 2 



+ o(ln(|fi| A)), |fii|A^oo, 



(100) 



where it is assumed that |fit,t| = \£l r ,i\ = |^| and equal power is allocated for each cluster Q2^=o E{\Xn | 2 } < P/M, i = 1, . . . , M). 
Comparing J86t and l llOOb . it is seen that the single- and multi-bounce diffuse channels have almost the same number of degrees of freedom 
for large |fi|A. 



B. Diversity gain 

As for the diversity gain of the slow-fading colored scattering channel, the same argument for the white scattering channel in 1161 can 
be revisited. Though the multi- and single-bounce diffuse channels have almost the same degrees of freedom, their diversity gains and the 
tradeoffs between the diversity and multiplexing gains are different. In particular, one thing to note for colored scattering is that, as in 
the case of degrees of freedom, the maximum diversity gain is limited by the correlation characteristic of the channel regardless of the 
lengths of arrays. By the same argument in 1161 , it is clear that the limits are given by |fit||fi r |/(rtrV) for the multi-bounce case and 
Efci |«t,i||ayi|/(r t IV) for the single-bounce case, respectively. 



C. Conclusion 

The correlation in the scattered fields has not been captured in conventional scattering channel models. In this paper, we introduced a 
correlated scattering model, referred to as the colored scattering model, and analyzed the impact of correlation on the channel capacity. In 
previous studies, it has been well known that the capacity and the diversity gain grow in proportion to the sizes of transmit and receive array 
apertures under the conventional white scattering model. In contrast, it was shown in this paper that the capacity and diversity gain can be 
saturated to certain values under the colored scattering model. The limits on the capacity and diversity gain are intrinsically determined by 
the correlation characteristic of the channel and, as the correlation decreases, i.e., as the model gets close to the white scattering model, the 
limits increase indefinitely. That is, the colored scattering model includes the conventional white scattering model as the limiting case. The 
result of this paper yields useful insight into the interaction between the channel and antenna arrays and is applicable in various situations 
concerning very large arrays. For example, as the larger and larger antenna arrays are being taken into consideration in modern and future 
communication systems to attain high spectral efficiency and reliability, the result of this paper can provide a guideline for the selection of 
the maximum aperture size with the given correlation characteristic of the scattering environment. 
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